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Abstract. On complete, non-compact manifolds and infinite graphs, Faber- 
Krahn inequalities have been used to estimate the rate of decay of the heat 
kernel. We develop this technique in the setting of finite Markov chains, prov- 
ing upper and lower L°° mixing time bounds via the spectral profile. This ap- 
proach lets us recover and refine previous conductance-based bounds of mixing 
time (including the Morris- Peres result), and in general leads to sharper esti- 
mates of convergence rates. We apply this method to several models including 
groups with moderate growth, the fractal-like Viscek graphs, and the product 
group Tj a X Z;,, to obtain tight bounds on the corresponding mixing times. 



1. Introduction 

It is well known that the spectral gap of a Markov chain can be estimated 
in terms of conductance, facilitating isoperimetric bounds on mixing time (see 
[SJ891 ILS88j ). Observing that small sets often have large conductance, Lovasz 
and Kannan f |LK99| ) strengthened this result by bounding total variation mixing 
time for reversible chains in terms of the "average conductance" taken over sets 
of various sizes. Morris and Peres (MP ) introduced the idea of evolving sets to 
analyze reversible and non-reversible chains, and found bounds on the larger L°° 
mixing time. 

To sidestep conductance, we introduce "spectral profile" and develop Faber- 
Krahn inequalities in the context of finite Markov chains, bounding mixing time di- 
rectly in terms of the spectral profile. FK-incqualities were introduced by Grigor'yan 
and developed together with Coulhon and Pittet ( [Ori94l ICouT)6l ICGP01I iBCOOlp 
to estimate the rate of decay of the heat kernel on manifolds and infinite graphs. 
Their techniques build on functional analytic methods presented, for example, in 
|Dav90| . We adapt this approach to the setting of finite Markov chains and derive 
i°° mixing time estimates for both reversible and non-reversible walks. 

These bounds let us recover the previous conductance-based results, and in gen- 
eral lead to sharper estimates on rates of convergence to stationarity. We also show 
that the spectral profile can be bounded in terms of both log-Sobolev and Nash 
inequalities, leading to new and elementary proofs for previous mixing time results 
- for example, we re-derive Theorem 3.7 of Diaconis-Saloff-Coste |DSC96a] and 
Theorem 42 (Chapter 8) of Aldous-Fill jAF] , 



In terms of applications, we first observe that for simple examples such as the 
random walk on a complete graph and the n-cycle, the spectral profile gives the 
correct bounds. As more interesting examples, we also analyze walks on graphs 
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with moderate growth, the fractal-like Viscek graphs, product groups like Z a x Zb, 
and show optimal bounds. In the case of the graphs with moderate growth, we 
show that the mixing time is of the order of the square of the diameter, a result 
originally due to Diaconis and Saloff-Coste (see |DSC94I IDSC96b] l. In the case of 
the Viscek graphs, we show that the spectral profile provides tight upper and lower 
bounds on mixing time, and observe that the conductance-based bounds give much 
weaker upper bounds. 

In Section n we introduce notation, review preliminary ideas and state our main 
results. Section |3 presents the proofs of both the continuous and discrete time 
versions of the spectral profile upper bound on mixing time. In Section[3]we recall 
a complementary lower bound shown in LGLGPQ 11 ■ Section 01 discusses applications, 
including the relationship between the spectral profile, log-Sobolev and Nash in- 
equalities. Section fOl discusses the more elaborate example of the Viscek graphs. 
Section T4.4I discusses the spectral profile of the random walk on Z a x Zf,, which 
turns out to be a bit subtle. 

1.1. Preliminaries. A Markov chain on a finite state space X can be identified 
with a kernel K satisfying 

K(x,y)>0 Y,K(x,y) = l. 

The kernel of K n is then given iteratively by K n (x, y) = K n -i(x, z)K(z, y) and 

can be interpreted as the probability of moving from state x to y in exactly n steps. 
We say that a probability measure 7r on X is invariant with respect to K if 
n(x)K{x 1 y) — 7r(y). That is, starting with distribution ir and moving according 

to the kernel K leaves the distribution of the chain unchanged. Throughout, we 
assume that K is irreducible: For each x, y S X there is an n such that K n (x, y) > 0. 
Under this assumption K has a unique invariant measure 7r and 7r„ = min^ n(x) > 0. 

The chain (K, ir) is reversible if, K = K* is a self-adjoint operator on the Hilbert 
space L 2 (tt). In general, K*(x, y) = ir(y)K(y, x)/ir(x), and so reversibility is equiv- 
alent to requiring that K satisfy the detailed balance equation: for all x, y £ X, we 
have i:{x)K(x,y) ~ ir(y)K(y,x). 

The kernel K describes a discrete-time chain which at each step moves with dis- 
tribution according to K . Alternatively, we can consider the continuous-time chain 
H t , which waits an exponential time before moving. More precisely, as operators 

H t = e~ tA A = I - K. 

The kernel of H t is then given explicitly by 

H t (x,y)=e~ t Y,-K n (x,y). 

n=0 

Let h(x,y,t) — Ht(x : y)/ir(y) denote the density of H t (x,-) with respect to its 
stationary measure 7r. 

To measure the rate of convergence to equilibrium, we first need to decide on a 
distance. 
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Definition 1.1. For two measures n and v with densities fix) — n(x)/ir(x) and 
g(x) = v(x)/tt(x) with respect to the positive measure n, their L p (ir) distance is 

d P ,n(n, v) = ||/ - g\\Lp(7r) for 1 < p < oo. 

For p = 1, this is twice the usual total variation distance. Furthermore, by 
Jensen's inequality, the function p i— > d Pj7r (/x, z/) is non-decreasing. 

Definition 1.2. The L p mixing time r p (e) for the continuous time chain with 
kernel H t {x,y) and stationary distribution tt is given by 



T p (e) = inf It > : sup d p ^(H t (x, •), 7r) < e 
I xex 

Our main result bounds the L°° mixing time (e) , also known as the e- uniform 
mixing time. Explicitly, 



T oo(e) = inf { t > : sup 

x.yeX 



H t (x,y) - n(y) 



< e 



To estimate mixing time, we prove lower bounds on the Dirichlet form associated 
to the walk. 

Definition 1.3. The Dirichlet form associated to K is 

SkV, 9) = J2 A /( x ) ' Six) tt(x) - (A /, g) v 

x£X 

where A = I — K and (•, -) w is the standard inner product for L 2 (ir). 
In particular, 

(1.1) £kUJ) = \ J2 [f(x)-f(y)?K(x,y)n(x). 

x,y£X 

Furthermore, £k{Ii f) = £fc* (/, /) = £ k+k* (/, /), which follows from equation 

2 

(11.11) and the identity K*(x,y)ir(x) = K(y,x)ir(y). Fix x £ X and set u x , t (y) — 
h(x,y,t). Then recall that 

(1.2) — Var(K X)t ) = ^2 -^ u l,t dn = _2 ^2 U ^,t^*u x ,td-K = -2£(u Xit , u x , t )- 

v v 

This argument motivates the standard definition of the spectral gap 

^ = inf 

/ Var(/) 

and the well known mixing time bounds using the spectral gap: 

(1.3) r 2 {e) < -^-\og—^— and T^l/e) < ^- ( 1 + log - 

Ai cWtu Ai V 7 



Note that by the Courant-Fischcr minmax characterization of eigenvalues, Ai is the 
second smallest eigenvalue of the symmetric operator (A + A*)/2. 
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1.2. Statement of the Main Result. Our main result bounds the L°° mixing 
time of a chain through eigenvalues of restricted Laplace operators. 

Definition 1.4. For a non-empty subset S C X, define 

\(S)= inf 

/ec+(S) Var(/) 

where Cq(S) = {/ : supp(/) C S, / > 0, / / constant}. 

In the reversible case, 

(1-4) A (g)<A(5)< i _ 1 A„(S) 

where Ao is the smallest eigenvalue of the restricted Laplacian Ag : Co(S) — > Cq(S) 
with Co (5) = {/ : supp(/) C S} and 



Asf(x) = ( Afix) xeS 
sJW \ x^S 



The kernel of Ag = 7 — _R"g is given explicitly by 

K*(x v) = l K{X ' V) X ' yeS 
s[ ,V> \ otherwise 

By the Courant-Fischer minmax characterization of eigenvalues, 11.4J1 is equivalent 
to the statement: 

. . Sk.UJ) / • r £K(f,f) , i . , £ Ks (fJ) 

mf — iT7TT2 — - mf T7 77n ^ i mf — iT7TT2 — 

/eco(S) H/lll /ec + ( 5) Var(/) 1 - ir(S) fec (S) \\f\\ 2 2 

The lower bound is due to the identity £x{f, f) = £k s {fi f) when / 6 cq(S), which 
follows from Af(x) — Asf(x) when / £ cq(S) and x £ S. The upper bound also 
requires the inequality {x-yf > (\x\-\y\) 2 to show that S(f, f) > |/|), while 
Cauchy-Schwartz gives |j/||i < \\fh\/^W) which implies Var(/) > {l-n(S))\\f\\l 
In general, when ir(S) < 1/2 then X(S) is within a factor two of the smallest 
eigenvalue of the symmetric operator (As + A* s )/2. 

We are interested in how X(S) decays as the size of S increases. 

Definition 1.5. Define the spectral profile A : [7r„, oo) — > R by 

AW= inf A(5). 

7r*<7r(S)<r 

Observe that A(r) is non-increasing, and A(r) > Ai. For r > 1/2, Lemma [2.21 
shows that A(r) is within a factor two of the spectral gap Ai. Furthermore, by 
construction the walk (K, n) satisfies the Faber-Krahn inequality 

X(S) > A(tt(5)) VScX. 

Theorem II. II is our main result: 

Theorem 1.1. For e > 0, the L°° mixing time T oa {e) for a chain H t (x,y) satisfies 

2dv 



,(*)< 



4tt, 



vA(v) ' 
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In Sect ion l2~31 we prove an analogous result for discrete-time walks. Since A(r) > 
Ai, Theorem 11.11 shows that 



4e 2dv 2 ( , 1 



Too(l/e)< / -T7-r<T- 1 + log — 

But since we can expect A(r) 3> Ai for small r, Theorem II . II offers an improvement 
over the standard spectral gap mixing time bound Ijl.^jl . In particular, by a discrete 
version of the Cheeger inequality of differential geometry, 

$l{r)/2 < A(r) < 2*»(r) 

where <f>*(r) is the (truncated) conductance profile (see Section |2~2"|) . Consequently, 
by Theorem ll.il 

Corollary 1.1. Fore > 0, the L°° mixing time Too(e) for a chain H t (x,y) satisfies 

^ Adv 



Lit, 

Theorem 13 of Morris and Peres |MP| is a factor two weaker than this. 

Although Theorem ll.il implies mixing time estimates in terms of conductance, 
it is reasonable to expect that for many models A(r) 3> &*{r). In these cases, com- 
pared to Corollarv ll.il presently the best known conductance bound, our spectral 
approach leads to sharper mixing time results. We provide below examples of such 
cases (see Sections 14. 11 and ET3)l . 

2. Upper Bounds on Mixing Time 

2.1. Spectral Profile Bounds. In this section, we prove one of the main results, 
Theorem ll.il The proof uses the techniques of Gri94 for estimating heat kernel 
decay on non-compact manifolds. The first Dirichlet eigenvalue Ao(S') for small 
sets 5* captures the convergence behavior at the start of the walk, when the fact 
that the state space is finite has minimal influence. The spectral gap Ai governs 
the long-term convergence. The spectral profile A(r) takes into account these two 
effects, since X(S) « X (S) for tt(S) < 1/2, and A(r) « Ai for r > 1/2. 

To bound mixing times, we first lower bound £{f,f) in terms of the spectral 
profile A, and as such Lemma ^.ll is the crucial step in the proof of Theorem ll.il 

We regularly use the notation that, given a function /,/+ = / V denotes its 
positive part, and /_ = —(/A 0) its negative part. 

Lemma 2.1. For every non- constant function u : X \— > R + , 

£{u,u) > l k (A{^u 



/4(Eu) 2 \ 
V Varu / 



Var u 2 

Proof. For c constant, £(u, u) = £(u—c,u—c). Also, Va, b £ R : (a— b) 2 > (a + —b + y 
so £ (/, /) > £(f+, /+)• It follows that when < c < max it then 

£(u,u) > £{{u — c)+, (u — c)+) 

> Var((u-c) + ) inf 

/gc+(«> c ) Var(/) 

> Var((u - c) + ) A(7r(u > c)) . 
Now, Va, b > : (a — b) 2 ^ > a 2 — 2b a and (a — b) + < a so 

Var((u - c)+) = E(u - c)\ - (E(u - c)+) 2 > Eu 2 - 2c Em - {Eu) 2 . 
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Let c = Var(w)/4Eu and apply Markov's inequality n(u > c) < (Eu)/c, 



8(u, u) > (Var(u) - 2cEu) A(Eu/c) = - Var(u) A 



4(Eti) 2 
Var u 



□ 



Now we bound the L 2 distance of a chain from equilibrium in terms of the 
function V(t) : [0, oo) — > R given by 

r v(t) dv 

1 ~ Jin, VA(V) ' 

Since the integral diverges, V(t) is well-defined for t > 0. 

The L 2 bound of Theorem l2.1l implies the L°° bound that is our main result. To 
prove the L 2 bound, we simply apply Lemma 12. II to the heat kernel h(x,y,t). 

Theorem 2.1. For the chain (K,ir), we have 

4 

Bwpd% (H t (x,-),ir) < ——. 
xex Vyt) 

Proof. Given x € X a value where the supremum occurs, define u Xl t(y) = h(x, y, t) 
and I x (t) — Yax(u x ,t)' If Ux,t = 1 then the theorem follows trivially. Otherwise, 
u x> t is non-constant and since Eu^j = 1, then by (|1.2I) and Lemma 12. 11 

(2.1) I' x {t) = -2£{u x , u ux,t) < -4A(4/4). 

Integrating over [0, t] we have 

_*_<_«. 

yj.(0) 4A(4/J X ) 
With the change of variable v = 4/I x , 



i < 



4/7. (0) 



uA(u) ' 



Since I x (0) = l/7r(y) - 1 < 1/tt* 



n<)< 



and the result follows. 



□ 



Now we show how to transfer the L 2 bounds of Theorem l2.1l to the L°° bounds 
of our main result. 

Proof of Theorem ll.il Observe that 



H t (x,y) - n(y) 



Ez ( H t/2(x, z) - tt{z)) (H t/2 (z, y) - tt(j/)) 



i"(y) 



H* /2 (y,z) 



(2.2) 



f H t/2 (x,z) \ (H t/ , 
< d 2 ^(H t/2 (x, •), 7r) d 2 , 7T (H* /2 (y, •), tt) 



- 1 
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where the inequality follows from Cauchy-Schwartz. Since we can apply Theo- 
rem 12.11 to either Ht or H£ , we have 

4 

sup \h{x,y,t) - 1| < ttt-j-t ■ 

So \h(x,y,t) - 1| < e for V(t/2) > 4/e, that is, for t such that 

r 4 / £ dv 



t/2 > 



r< vA(v) 



proving the result. □ 

The next result shows that any improvement in using the spectral profile A(r) 
instead of the spectral gap Ai comes from looking at small sets since for r = 1/2, 
already A(r) w Ai. 

Lemma 2.2. The spectral gap Ai and the spectral profile A(r) satisfy 

Ai < A(l/2) < 2Ai. 

Proof. The lower bound follows immediately from the definition of the spectral gap. 
For the upper bound, let to be a median of /. Then using Lemma 12.31 

£(/,/) = £(f-m,f-m) 

> £{{f - m)+, (/ - m)+) + £((f - m)_, (/ - m)_). 
Since 7r({/ > to}) = 7r({/ < to}) < 1/2, we have 

£{{/ - m)+, (/ - m)+) > ||(/ - to)+||| A ({/ > to}) 

and 

£((/-m)_,(/-m)_) > ||(/-m)_||lAo({/<m}) 

Consequently, 

£(f,f) > \\f-m\\t inf , X (S) 



> Var(/) 



■(S)<l/2 

A(l/2) 



2 

The upper bound follows by minimizing over /. □ 

The proof required the following lemma. 
Lemma 2.3. Given a function f : X t— > R then 

£(f, f) > £(/+,/+) + £(f-J-) > £(\f\, l/l). 

Proof. Given <?, h : X i— > R with g,h > and (supp g) f~l (supp h) — $ then 

because the first sum is zero and every term in the second is non-negative. In 
particular, /+,/- > with (supp/+) n (supp/_) = 0, and so by linearity 

£(f,f) = £(f+-f-J+-f-) 

= £(f + , /+) + £(f-J-) - £(f+,f-) - £(J-,f+) 

> £(/+,/+)+£(/_,/_) 

> £ (/+, /+) + £(f-J-) + £(f+,f-) + £(f-,f+) 

= £(\f\Af\)- 
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□ 

2.2. Conductance Bounds. In this section, we show how to use Theorem 1 1.1 1 to 
recover previous bounds on mixing time in terms of the conductance profile. 

Definition 2.1. For non-empty A,BcX, the flow is given by 

Q(A,B)= ]T Q{x,y) 

where Q{x,y) = n{x)K{x,y) can be viewed as a probability measure on X x X. 
The boundary of a subset is defined by 

dS = {x e S : By S, K(x, y) > 0} 

and \dS\ = Q(S,S C ). 
Observe that 

tt(S) = Q(S, X) = Q(S, S) + Q(S, S c ) 

and also 

n(S) = Q(X, S) = Q(S, S) + Q(S C , S). 

It follows that Q(S, S c ) = Q(S C , S). 

Like the spectral profile A(r), the conductance profile <&(r) measures how con- 
ductance changes with the size of the set S. 

Definition 2.2. Define the conductance profile $ : [71"*, 1) — > R by 

1 / \ . . \dS\ 

7r»<7r(5)<r 7t(S) 

and the truncated conductance profile : [7T*, 1) — > R by 

$(r) r<l/2 



$(1/2) r>l/2 



The value <I>(l/2) is often referred to as the conductance, or the isoperimetric 
constant, of the chain. 

The next lemma is a discrete version of the "Cheeger inequality" of differential 
geometry, and will let us apply Theorem 11.11 to recover the conductance profile 
bound of Corollary 11.11 The proof of the lemma is similar to the proof given in 
|SC96| of the fact that 

< Xi < 2*(l/2). 

Lemma 2.4. For r G [it*, 1), the spectral profile A and the conductance profile $ 
satisfy 

$2 W a /„\ ~* *(r) 



< A(r) < 



1 



Proof. It suffices to show that \ $ 2 (tt(A)) < X (A) < ^ for every Ad X. The 
bound then follows from p. 4(1 by minimizing over sets with tt(A) < r. 
For the upper bound, 

X (A) < £ 4^> = 441. 
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To show the lower bound, for a non-negative function /, define the level sets 
Ft = {x E X : f(x) > t] and the indicator functions ft — l_F t . Then 



E (J™ ft{x)dt ) Ax) 

ir(F t )dt. 



(2-3) Hf) = 



Furthermore, 

x,y x,y 

= E [/(*)- /(j/)][Q(*>v) + Q(im 

f(x)>f(y) 



I'OC 

E / 1 U(y)<t<f(x)}[Q(x,y) + Q{y,x)]dt 
f(x)>Hv) 

poo pOO 

/ \dF t \dt+ / \dF t c \dt 
Jo Jo 



/•oc 

(2.4) = 2 / \dF t \dt. 

Jo 

Observe that 1)2. 4|l is a discrete analog of the co-area formula. For non-negative 
/ E Co (A), F t d A for i > 0, and so 

/>oo 

> 2$(tt(A)) / Tr(F t )dt 
Jo 

= 2$(k(A)W) by(E3|). 
Consequently, for any non-negative / G cq(j4), by the above 
2$( 7 r(A)) 7 r(/ 2 ) < ^|/ 2 (x)-/ 2 (t/)|Q(x,y) 



< 



Then 



53 •(/(*) + /(»)) 

[53(/(x)-/(j/)) 2 Q(x,y)] 

x [53(/(a ; ) + /( y )) 2 Q(a ; ,j/)] 
< (2£(/,/)) 1/2 (4tt(/ 2 )) 1 /2. 

Ao(A) = mf _ > 



/ec+(A) tt(/ 2 ) 2 
The infinum for Ao(^4) occured at f € Cq (j4) because for general / € Co(A), 
f(/,/)>f(|/|,|/|)and7r(/ 2 )=7r(|/| 2 ). □ 
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Remark 2.1. From the proofs of Lemma \l. 41 and Lemma \l. 21 we have that 

$ 2 (l/2) 

^±L£l < inf \ (A) < Ai. 

2 tt(A)<1/2 

Consequently, when r > 1/2 then $J(r)/2 < Ai < A(r), proving the conductance 
profile bound of Corollarv ll.il 

2.3. Discrete-Time Walks. In this section we consider discrete-time chains, de- 
riving spectral profile bounds on mixing time similar to those for continuous-time 
walks. For u x .t(y) = h(x, y, i), the rate of decay of the heat operator in the contin- 
uous setting is given by 

(2.5) —Vax(u x , t ) = -2£(u x ,u u x , t )- 

In the discrete-time setting, set u x ^ n (y) = k(x,y,n) — ■ Then, since 

K*Ux,n = u x . n+ i and E(u x . n ) = 1, 

(2.6) Var(tt X)n+ i) - Ya,v(u x ^ n ) = (u Xi „ + i, u XjTl+ \) - {u x>n , u x<n ) 

= -((I - KK*)u Xi „,u Xtn ) 

= —£KK*(u x ,n,U x , n ) 

and so it is natural to consider the multiplicative symmeterizations KK* and K* K. 
In order to relate mixing time directly to the kernel K of the original walk, we use 
the assumption that for a > 

K(x,x)>a Vx G X. 

Define Akk* and Vkk* to be the analogs of A and V where £k{J: f) is replaced 
by Ekk* {f, /)• If KK* is reducible, then Af K = 0, and so we restrict ourselves 
to the irreducible case. We define Ak*k and Vk*k similarly and also assume 
irreducibility. The following result is a discrete-time version of Theorem 12.11 and 
its proof is analogous. 

Theorem 2.2. For a discrete-time chain {K, tt) with K* K and KK* irreducible 

sup dl (K n (x, •), tt) < — — r and sup d% {K*{x, ■), tt) < — — . 

x ex V K K'{n/2) xeX V K »K{n/2) 

Proof. The second statement follows from the first by replacing K by K * . For fixed 
x G X, define u xn (y) = k(x,y,n) and I x (n) = Vav(u Xtn ). By l|2.6|l and Lemma 12. II 

I x (n + 1) - I x (n) = -£KK*{u Xl n,U x , n ) 

< --I x {n)A KK ,(4/I x (n)). 

Since both I x {n) and Akk* {t) are non- increasing, the piecewise linear extension of 
I x (n) to R + satisfies 

4(i)<-~/*(t)A KK .(4// x (t)). 

At integer t, we can take either the derivative from the right or the left. Solving 
this differential equation as in Theorem l2.ll we have 

and the result follows. □ 
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Corollary 2.1. Assume that K(x,x) > a > for all x € X . Then for e > 0, the 
L°° mixing time for the discrete-time chain K n satisfies 

r 4 ^ dv 



Too(e) < 2 



47Tr avA(v) 



Proof. Since K*{x, x) — K{x, x) > a, observe that 

KK*(x,y)ir(x) > K*(x, x)K(x, y)ir(x) + K*{x, y)K(y, y)ir{x) 
> aK(x, y)ir(x) + aK{y, x)n(y) 

and so, 

£KK*(fJ)>2a£ K (f,f). 
Consequently, Akk * > 2oA, from which it follows that 

rV(at) dy 



at = 



> 2a 



4tt. vA (v) 
V W dv 



/ 47r< _ vA KK *(v) 

Accordingly, V K K*(t/2) > V(at), and similarly V K *K{t/2) > V(at). As in Theo- 
rem ^3 

\k(x,y,2n) - 1| < d2,iv(K n (x,-),ir)d2,ir(K*(y,-),n) 
4 



< 



And so, \k(x,y,2n) — 1| < e for 



n > 



V(an) 
^ dv 



4,7, avA{v)' 



□ 



Improvement for discrete-time using rescaling. Given a Markov kernel K let 
A K (r) and <& K (r) denote the spectral and conductance profiles, respectively. Then 

A K (r) = (l-a)Ai^(r) > I^$^( r )2 
I- a (<5> K {r)\ 2 _ <^> K (r) 2 



1 -a 



2(1 - a) 



The appropriate discrete time version of Corollarv ll.il is then 
Corollary 2.2. For e > 0, the L°° mixing time T oc (e) for the chain K n satisfies 

f^ e 2dv 

>(e) < 2 



In contrast, the bound of Morris and Peres |MP| is 
Too(e) < 2 



'4tt» m in| lr si F ,l| v$l(v) 
which is similar, but slightly weaker when a 1/2. 
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3. Lower Bounds on Mixing Time 

In this section, we recall a result of |COP01| to show that for reversible chains 
the spectral profile describes well the decay behavior of the heat kernel h t (x,y) — 
H t (x, y)/ft{y)- These results are based on the idea of anti-Faber-Krahn inequalities. 

For reversible chains, l|2.2|) implies 

x,y ~{y) x z ^ ~\ z ) J x ~{x) 

and so 

sup h t (x, y) = sup h t (x 7 x) . 

Lemma 13. II gives a simple lower bound on the heat kernel. 

Lemma 3.1 ( CGP01 ). For a reversible chain (K,tt) and non-empty S C X, 

, / x . exp(~tX (S)) 
sup h t {x,x) > — — . 

x£X ZTT{b) 

Proof. Let A (S) < Xi{S) < ■ ■ ■ < X\ S \-i(S) be the eigenvalues of I - K s . Then 
Ks has eigenvalues {1 — Aj(S)}. Since tr(JT|) can be written as either the sum of 
eigenvalues, or the sum of diagonal entries, we have 

\S\-1 

£(l-A 4 (S)) fc = X^&z,*) 

x£S 

For k even, all the terms in the first sum are non-negative, and consequently 

(l-\v(S)) k <Y,Kk(x,x). 

x£S 

Finally, to bound the continuous-time kernel, note that 



tt(S) sup ht(x, x) > ht(x, x)tt(x) 



x£X 



xGS 

°° fik 

* E e "*E(2jfe)!^ fc (x s x) 

x£S k=0 y ' 



>- e "'E 



— t 2k (l-X (S)) 2k 



exp[i(l - A (5))] + exp[-t(l - X (S))} 



from which the result follows. □ 

Theorem l3.1l is a partial converse of the upper bound given in Theorem l2 . 1 1 under 
the restriction of 5-regularity. 

Definition 3.1. A positive, increasing function / £ C 1 (0,T) is (5-regular if for all 
0<t<s<2t<T 

f'(s) f'(t) 
f(s) ~ /(*) " 
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Definition 3.2. The walk (K, tt) satisfies the anti-Faber-Krahn inequality with 
function L : [tt*, 00) — > R if for all re [V*, 00), 

inf A„(S) < L(r). 

7r^<7r(5)<r 

Remark 3.1. Observe that (K, tt) satisfies the anti-Faber-Krahn inequality with 
L(r) = A(r), in light of ifTljl . 

Theorem 3.1 ( CGP0l|). Let (K,tt) be a reversible Markov chain that satisfies the 
anti-Faber-Krahn inequality with L : (n^^oo) — > R, and that ^(t), defined implicitly 
by 

n {t) dv 

vL(v)' 

is 5-regular on (0, T). Then for t € (0, ST/2) 

suph t (x,x) > JrrTFs - 

Proof. Fix t € (0, 8T/2) and set r = j(t/S). By the anti-Faber-Krahn inequality 
there exists S C X with 7r(5) < r and Ao(S') < L{r). Consequently by Lemma I5~TI 

u I \ ^ exp(-U (S')) exp(-£L(r)) 

sup ^(2:, a:) > —j^- > . 

xex 2tt(5) 2r 

So, sup,,, ht(x,x) > exp(— C t ) for C t = log2r + tL(r). Since L(7(s)) = (log7)'(s) 

C t = \og2 1 (t/6) + t(\og 1 y(t/6). 

By the mean value theorem, there exists 9 £ (t/6, 2t/S) such that 

(log7) / (g) = 10g7W ~ 1 ° S7( ^ ) - 

By 5-regularity 

(log 7 )'(0) > <5(log 7 )'(VS) 
and so Ct < log[27(2t/<5)], showing the result. □ 

4. Applications 

The following lemma, while hardly surprising, is often effective in reducing com- 
putation in specific examples. In particular, it is used in computing the spectral 
profile of the random walk on the n-cycle in the present section. 

Lemma 4.1. Let S — Si U ■ ■ • U Sk be a decomposition of S into connected compo- 
nents. Then 

X(S) = min{A(^)}- 

Proof. Clearly A(S) < mins i {A(S'i)}, and we need only show the reverse inequality 
For a function / > 0, define fs t = ls 4 /- Then 

Var(/) = Var fe/ftl = E E 4 - (E E/s ') " E Var (^)- 
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Consequently, 

\(S) = inf £{fJ) 



fec+(s) Var(/) 

mi 

fec+(s) Var(/) 



> inf 



fec+(s) Var(/) 

and the result follows. □ 
4.1. First Examples. 

4.1.1. The Complete Graph. Consider the continuous-time walk on the complete 
graph in the n-point space = {xi, . . . ,x n } with kernel K(xi,Xj) — 1/n Vi, j. To 
find the eigenvalues of the restricted operator Kg : cq(S) i— ► cq(S), we consider 
functions / : {x\, . . . , X\g\} i— ► R. Since 

1 |S| 

W0=i) = -£/0*) = / 1<J<|5| 

i=l 

/ is an eigenfunction of isTs with corresponding eigenvalue A if and only if Xf(xj) = 
/ for 1 < j < \S\. If A ^ 0, then this implies that / is constant with eigenvalue 
A = \S\/n. So, the smallest eigenvalue of / — K$ satisfies Xq(S) = 1 — \S\/n, and 
the second smallest eigenvalue of / — K satisfies Ai = 1. Since 

Ai<A(S)< Xo{S) 



1 - n(S) 



A(5) = 1 and accordingly A(r) = 1. 

Theorem 11.11 then shows that for the complete graph Too(e) < 21og(n/e). Since 
the distribution of the chain at any time t > is given explicitly by 

(1 - e~*) 

H t (xi,Xj) = e t S Xi (x j ) H 

we see that Too(e) = log[(n — l)/e], and so our estimate is off by a factor of 2. 

4.1.2. The n-Cycle. To bound the spectral profile A(r) for simple random walk 
on the n-cycle, recall (see Lemma 14 - If) that it is sufficient to restrict our at- 
tention to connected subsets. Now consider simple random walk on the n-cycle 
SI = {xq, . . . x n -i} given by kernel K(xi, Xj) = 1/2 if j = i ± 1 (mod n) and zero 
otherwise. By Lemma [4.11 to find \q{S) we need only consider connected subsets 
S C SI. For S such that tt(S) < 1, I — Kg corresponds to the tridiagonal Toeplitz 
matrix with l's along the diagonal and 1 /2's along the upper and lower off-diagonals 
(and 0's everywhere else). In this case, the least eigenvalue is given explicitly by 

A (5) - 1 - c. 



Since the spectral gap satisfies Ai = 1 — cos(27r/n), we have A(r) « l/(rn) 2 for 
1/n < r < 1. Theorem 11.11 then shows the correct 0(n 2 ) mixing time bound. 
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4.2. Log-Sobolev and Nash Inequalities. Logarithmic Sobolev and Nash in- 
equalities are among the strongest tools available to study L 2 convergence rates of 
finite Markov chains. Log-Sobolev inequalities were introduced by Gross |Gro75l 
IGro93| to study Markov semigroups in infinite dimensional settings, and developed 
in the discrete setting by Diaconis and Saloff-Coste DSC96a . Nash inequalities 
were originally formulated to study the decay of the heat kernel in certain parabolic 
equations (see Nas58 ). Building on ideas in CKS87, GS G90b| ICSC 90a , Diaconis 
and Saloff-Coste |DSC96b] show how to apply Nash's argument to finite Markov 
chains. In this section we show that both log-Sobolev and Nash inequalities yield 
bounds on the spectral profile A(r), leading to new proofs of previous mixing time 
estimates in terms of these inequalities. 

Definition 4.1. The log-Sobolev constant p is given by 

. , £(f,f) 

p = ml rr 

Ent^/VO Ent^/^ 

where the entropy Ent^/ 2 ) = ]T f 2 {x) log (f 2 {x)/\\f\\ 2 2 ) n(x). 

xex 



Lemma 4.2 (Log-Sobolev). The spectral profile A(r) and log-Sobolev constant p 
satisfy 

log(l/r) 



A(r) > p- 



Proof. By definition 



A(r) = inf inf ^ > p inf inf E ^ 



7r(S)<r /ec +( S ) Var 7r (/) 7r(S)<r/ ec +( S ) Var^/) 

The lemma will follow if for every set S C X 

inf EnW(.f 2 ) > log^ 



fec+(S) Va r7r (/) 1-tt(5) ' 

Define a probability measure tt'(x) = ^|^y if x S S and tt'{x) — otherwise. Then 

EnW(/ 2 ) _ EnV(/ 2 ) + log 7t L y ^/ 2 

/e4(5) Var»(/) f £$ {S) E^f 2 — tt(S) {E^f) 2 

Rearranging the terms, it suffices to show that 

inf EnV(/ 2 ) > ^g)^gjgT 
/gc+(S) Var T /(/) " l-7r(5) 

However, since ix (S) S (0,1) then 7r (5) 1 °g(iAg)) < i an d S o it suffices that for 
every probability measure and / > that Ent(/ 2 )/Var(/) > 1. This is true, as 
observed in |LO00| and recalled in Remark 6.7 of [BT03| . □ 

The bound Xq(A) > plog(l/7r( J 4)) can be shown similarly, but without need for the 
result of LO00 . Like log-Sobolev inequalities, Nash inequalities also yield bounds 
on the spectral profile: 
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Lemma 4.3 (Nash Inequality). Given a Nash inequality 



\\f\\l +1/D <c 



£<J,f) + ^11/111 



i/ii; 



l/D 



which holds for every function f : X t— > R and some constants C, D, T G R+, </ien 
Proof. The Nash inequality can be rewritten as 

£(fJ) > i f\\fh\ 1/D i 



11/112 " c \\\fhJ t 

Then, 

Ao(A)= inf > inf if#V /Z3 -i 

0V y /ec„(A) H/lll " fec (A) C \\\f\\J T 

1 1 

> 



The final inequality was due to Cauchy- Schwartz: \\f\\\ < \\fW2 \Ar(supp /). The 
lemma follows by minimizing over tt(A) < r. □ 

Although the spectral profile A(r) is controlled by the spectral gap Ai for r > 1/2, 
Nash inequalities tend to be better for r close to 0, and log-Sobolev inequalities for 
intermediate r. Combining Lemmas 14.21 and 14.31 we get the following bounds on 
mixing time: 

Corollary 4.1. Given the spectral gap Ai and the log-Sobolev constant p and/or a 
Nash inequality with DC > T , D > 1 and ir* < l/4e, the L°° mixing time for the 
continuous-time Markov chain with e < 8 satisfies 

2 12 8 

Too(e) < - loglog - — + — log - 
p 47T* Ai e 

2 / 2DC 4 

Too(e) < 4T+— (2Dlog— +log- 

2 (2DC\ 2D 2 



Too(e) < 4T+- loglog — J + AT l0g 7 

Proof. For the first upper bound use the log-Sobolev bound A(r) > plog(l/r) when 
r < 1/2 and the spectral gap bound when r > 1/2. Simple integration gives the 
result. 

For the second upper bound use the Nash bound when r < (T/2DC) 2D and 
spectral gap bound for the remainder. Then 

; (T/2DC)"> 2dr /-4A 2dr 

Toc(e) < 



r T ^ (l - ^) hT/2DC Y ° r Ai 



< 4T + Alog. 4/£ 



Ai b {T/2DC) 2D 

where the second inequality used the bound 1 — ^-rp — > 1 — gjj > 1/2 before 
integrating. Simplification gives the result. 
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For the mixed bound use the Nash bound when r < (T/2DC) 2D , the log-Sobolev 
bound for (T/2DC) 2D < r < 1/2 and the spectral gap bound when r > 1/2. □ 

Similar discrete time bounds follow from Corollarv l2.ll When \fx : K(x,x) > a 
then these bounds are roughly a factor a -1 larger than the continuous time case. 

These bounds compare well with previous results shown through different meth- 
ods. For instance, Aldous and Fill |AF| combine results of Diaconis and Saloff-Coste 
DSC96a, DSC96b to show the continuous time bound 

1 , , fDC\ D 1 



Toc(e) < 2T + - log log ^— 1 + — (4 + log(l/e)) 
whenever DC >T. 

4.2.1. Walks with Moderate Growth. In this section, we describe how estimates 
on the volume growth of a walk give estimates on the spectral profile A(«). The 
treatment given here is analogous to the method of Nash inequalities described in 

H2S35H- 

Define the Cayley graph of (K, tt) to be the undirected graph on the state space 
X with edge set E — {(x,y) : ir(x)K(x,y) + ir(y)K(y, x) > 0}. Let d(x,y) be the 
usual graph distance, and denote the closed ball of radius r around x by B(x, r) — 
{z : d(x, z) < r}. The volume of B(x, r) is given by V(x, r) = J2zeb(x r) 7r ( z )- 

Definition 4.2. For A, d > 1, the finite Markov chain (K,tt) has (A, <i)-moderate 
growth if 

(4.1) V(x, r)>j (~^~) Vx S X, < r < 7 

where 7 is the diameter of the graph. 
For any / and r > 0, set 

V(x,r) s 

Definition 4.3. The finite Markov chain (K, tt) satisfies a local Poincare inequality 
with constant a if for all / and r > 

(4-2) ||/ -f r \\l<ar 2 £ (/,/). 

Under assumptions l|4.1[l and l|4.2|l . Diaconis and Saloff-Coste [DSC96b] derive 
the Nash inequality 

£(f,f) + All/ll 2 



(4.3) \\f\\l +i/d <C 



wy 2 



\f\\t /d 



where C = (1 + l/d) 2 (l + d) 2 / d A 2 l d a~j 2 . By Lemma ET31 this immediately implies 
the lower bound on the spectral profile 

A ^ " ((^+1)2+2/^2/^2/^ " X ) ^2- 

Theorem 14.11 below shows how to bound A(r) in terms of a local Poincare in- 
equality and the volume growth function 

V*(r) = MV(x,r). 
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The proof is similar to the derivation of Nash inequalities for walks with moderate 
growth shown in [DSC96b] . 

Theorem 4.1. Let (K,ir) be a finite Markov chain that satisfies the local Poincare 
inequality with constant a. For v < 1/2, the spectral profile satisfies 

1 

AM > — r 

v ; - AaW 2 (2v) 

where W(v) = inf{r : V*(r) > v}. 

Proof. Fix S C X with n(S) < 1/2 and / € cq(S). It is sufhcient to show that 

£{f,f) > i 



First observe that 



Now, 



/111 '-AaW*(MS)y 



WfWl = (f-frj) + (fr,f) 

< 11/ -M|2 -11/112 + (fr,f)- 



(/-/) = EIv^t E f(yHy))m< x ) 

V ' ' yeB(x,r) 

1 

< 



" K(r)' 

Consequently, by the local Poincare inequality, 




\\ff 2 <^r£(f,f)y 2 \\fh + $pr\\f\ 



Dividing by \\f\\\ and choosing r — W(2tt(S)) we have 

and the result follows. □ 

Corollary 4.2. Let {K,tt) be a finite Markov chain that satisfies (A, d) -moderate 
growth and the local Poincare inequality with constant a. Then the L°° mixing time 
satisfies 

Too(e) < C(a, A, d, e)j 2 

where 7 is the diameter of the graph and C(a, A, d, e) is a constant depending only 
on a, A, d and e. 

Proof. By the moderate growth assumption, W(v) < r y(Av) 1 / d . And so for v < 1/2 



AaW 2 (2v) ~ 8aA 1 / d j 2 v 2 / d ' 

For v > 1/2, note that A(v) > Ai > A(l/2)/2. The result now follows immediately 
from Theorem ll.il □ 
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In Theorem 3.1 of DSC94 Diaconis and Saloff-Coste show that for walks on 
groups with (A, (i)-moderate growth, local Poincare inequality with constant a, 
and 7 > AA d+1 

7 2 

Too(l/e) > 



42^+1^42 " 

It follows that T^l/e) = 0(7 2 ), and Corollary 14.21 was of the correct order 7 2 . 

For instance, consider the example of simple random walk on the n-cycle dis- 
cussed in Section 14.1.21 For this walk V(xi,r) = (1 + 2[r\)/n, and so it satisfies 
the moderate growth criterion (|4.1|l with A = 6, d = 1 and diameter 7 = [n/2\. 
Moreover, it is shown in DSC96b that every group walk satisfies the local Poincare 
inequality 

||./--/ r || 2 .<2|S|r 2 £(/,/) 

where S is a symmetric generating set for the walk. Consequently, Corollary 14.21 
shows that the walk on the n-cycle mixes in 0(n 2 ) time. For several additional 
examples of walks with moderate growth, see |DSC94I IDSC 96b . 

4.3. The Viscek Graphs. For a random walk (K, tt) consider its Cayley graph 
defined in Section 14.2.11 Define the minimum volume of a disk of radius r by 
V*(r) = inf x {V(x,r)}. Here we first use a result of |BCG01| that shows that the 
spectral profile A(r) can be bounded in terms of the volume growth V*(r) alone 
(see Lemma l4~4"jl . We then apply this technique to analyze walks on the fractal- like 
Viscek family of finite graphs. 

Lemma 4.4 f jBCGOlj ). Let = mf x ^ y [n(x)K(x, y) + n(y)K{y,x)] and 

w(r) =inf{fc: > r}. 

Then 

\( A) > 

y ' ~ 4tt(A)w(tt(A)) ' 
Proof. Fix / E cq(A) normalized so that ||/||oo = 1- Then, 

WfWi = £i/(x)| 2 tt(z)<^)- 

x 

Let Xo be a point such that |/(xo)| = 1 and let k — m&x{l € N : B(xq,1) C A}. 
Then there is a sequence of points Xq, x%, . . . , Xk+i with Xi ~ a;»+i, xq, . . . , Xf. G A 
and Xk+i $ A. So, 

£(fJ) = 5£l/(s)-/(v)lM*)*(z>v) 

1 k 

- o ^ \f( Xi +^ ~ /( a; 0| 2 [ 7r ( a; 0- ft: ( a; i> + n(x i+ i)K(x i+ i,Xi)] 

»=0 



> 



2(k + 


2) 


Q* 




2(fc + 


2) 







£l/(*<+i) -/(**)! 

\i=0 / 

\f(x k+1 )-f(xo)\ 2 



2(fc + 2)' 
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X 




Figure 1. The first three generations V4(0), 14(1) and V4(2) of a 
Viscek graph with N = 4. 

Consequently, 

Ao(A)= inf 



fec (A) \\f\\i ~ 2(k + 2)n(A)- 
To finish the proof, observe that n(A) > V(x ,k) > V*(k), and so w(ir(A)) > 
k + 1 > (fc + 2)/2. □ 

The Viscek graphs are a two parameter family of finite trees that are inductively 
defined as follows. Fix the parameter N > 2, and define Vn(0) to be the star graph 
on N + 1 vertices (i.e. a central vertex surrounded by N vertices). Given Vat(tt.— 1) 
choose N vertices xi, . . . ,xn such that d(xi,Xj) = diam(VAr(n — 1)) for i ^ j. 
Construct Vat (n) by taking TV + 1 copies {V l N {n— 1)}^L 01 the (n— l) th generation 
graph, and for 1 < i < N identifying x® £ V%(n— 1) with x\ £ V l N (n— 1). Observe 
that a different choice of vertices Xi, . . . , xm leads to an isomorphic construction. 
For N = 2, V2(n) is a path for each n. Figurenillustrates the first three generations 
of a Viscek graph for N = 4. 

The following lemma bounds the spectral profile and mixing time for simple 
random walk on Vn(ti)- The proof is analogous to the volume growth computation 
for the infinite Viscek graph W(oo) = lim n _>oo Vtv(^) given in BCGQl] and recalled 
in |FBq . 

Lemma 4.5. For N > 2, r < 1 the spectral profile A(r) for simple random walk 
on Vjv(n) satisfies 

where 7 = diam(Vpf(n)) = 2-3" and i/ie constants a. A > depend only on N . 

In particular, there exist constants b, B > depending only on N such that the 
mixing time for the continuous-time walk satisfies 

6(V) 7 d+1 < n(l/e) < ^(Ve) < B(N) 7 d+1 . 
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Observe that since the conductance profile for Vtv(^) satisfies 

1 1 



$(r) 



\E^r(n)\r "/ d r' 

using the conductance profile bound of Corollary 11.11 results in the upper bound 
Too(l/e) di l 2d which overestimates the mixing time for N > 3. 

Proof. We first show that the mixing time bound follows from the spectral profile 
estimate. The upper bound is a direct consequence of Theorem 11.11 Recall that 
for an ergodic chain, the spectral gap Ai and L 1 mixing time are related by 1/Ai < 
Ti(l/e) (see e.g. |SC96| ). Since A(r) > Ai, the lower bound is immediate. 

To estimate the spectral profile, first note that the number of edges \En(ti)\ = 
N(N + 1)". Since V N (n) is a tree, |Vjv(n)| = N(N + l) n + 1. Furthermore, 
diamOV(n)) = 2-3". 

For < k < n, define a /c-block to be a subgraph of V_/v(n) isomorphic to the k th 
generation graph Vjv(fc). Fix x S Vn(u) and 3 < r < diam(Vjv(n)). Then there is a 
unique integer m such that 3 m + 1 < r < 3 m+2 . Moreover, the vertex x is contained 
in some m block B. Since diam(-B) = 2 • 3 m , B(x,r) D B. Consequently, 

\B(x,r)\ > \B\ = N{N + l) m + 1 

and since 7T* = l/(2|£'jv(n)|) 

v . ir)> wpr+±y(L ' 

1 ' - 2N{N + l) n ~ v7 

where d = log 3 (A^+ 1) and the notation a ^ b indicates that there is some constant 
c(N) > depending only on N such that a < c(N)b. Thus, using the notation of 
Lemma IP1 w(s) < ^s 1 / 4 . Since Q» = l/|Sjv(n)| > l/7 d , Lemma 1^1 gives the 
lower bound on the spectral profile. 

For the upper bound we construct test functions f m supported on m-blocks. 
Given an m-block A c Visr(n), choose vertices xi,...,Xn such that d(xi,Xj) = 
diam(A) for i ^ j, and call the shortest paths between these vertices diagonals. 
These diagonals meet in a unique point o at the center of the m-block. Define the 
function f rn e co (A) as follows: Along diagonals, f m varies linearly with f m (p) = 1 
and f m {xi) — 0. Since d(o,Xi) = diam(A)/2 = 3 m , along diagonals the function 
is given explicitly by f m {x) = 1 — d(o, x)/3 m . For a point x off of the diagonals, 
let f m (x) = fm(x') where x' is the closest point to x that lies on a diagonal. 
(See Figure |3 for a graphical representation of f m ). Now, since K(x,u)tt(x) = 
l/(2\E N (n)\) forx~y 

£(fm,f m ) = \fm(x) - f m (y)\ 2 K(x,y)Tr(x) 



m 7 



2\E N (n)\ 



Define the central m - 1 block of A to be A' = {x E A : d(o, x) < 3 m_1 }. Since 
fm(x) > 2/3 on A' 7 

vjl>U*)~ (N+ir 



9 v ' 7 



d 
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FIGURE 2 . A graphical representation of the test function f 2 sup- 
ported on a 2-block. 



It is sufficient to prove the upper bound on A(r) for 1/(N + 1)" 2 < r < 1/2. For 
these r take 

logr(A + l)™- 2 



m(r) = 



< n. 



log TV + 1 

Then (N + l) m(r ^ < r(N + 1)"~ 2 and so for an m(r)-block K, ir(K) < r. Conse- 
quently, for r in this range 

£{.frm.fm) ^ 2£ (f m , fm) 



Var(/ m ) 
Finally, since (iV + l) m >: rj d 

£ ( / m j frn ) 

Var(/ m ) 



ll/mlll 



[3(A+1)]' 



log 3(JV + 1) 

= (iV + i)- m ( 1 + 1 / d ) 

1 



-< 



r 



d+l r l+l/d 



and the upper bound on A(r) follows. 



□ 



4.4. A delicate example. Consider simple random walk on the product group 
Z„ x Z„2. For this model, it is not hard to see that 7 = 0(n 2 ) and that the volume 
satisfies 



VAr) 



(r + iy/n* 0<r<n 
r/n 2 n < r < n 2 
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Taking r = in l|4.f |) shows that walks with moderate growth must have 

n 3 A \n 2 

Consequently, Z„ x Z n 2 is of moderate growth with d = 3/2 and furthermore, this 
is the optimal choice of d (assuming A and d are constant). Corollary 14 . 21 gives the 
correct j 2 — n A mixing time, but gives the underestimate 

. . . C(a,A) 

for the spectral profile. The problem is that the moderate growth criterion alone is 
not sufficient to identify the two different scales of volume growth present in this 
example: For r<l/n the space appears 2-dimensional, while for r 3> 1 /n it looks 
1-dimensional. However, we can apply Theorem 14. II to directly take into account 
volume estimates, leading to sharp bounds on both the spectral profile and the rate 
of decay of d x ^(Il u n). 

Lemma 4.6. For a,b > 2, the walk on G = Z a xZf, with generating set {(±f , 0), (0, ±1)} 
has spectral profile satisfying 

1/vab 1/ab < v < a/b 
A(v) x { l/v 2 b 2 a/b<v<l 
l/b 2 l<v 



In particular, 



, (rr wW j ab/(t + l) 0<t<a 2 

doaA*U*)~ i b/t l/2 a 2 <t<b 2 



and there are constants c\ , C2 > such that for t > b 2 

e' cit/b2 <d 00 AHt^)<e^ t ' h \ 

Proof. Without loss of generality, assume a < b. Then diam(G) = 0(6), and the 
volume function is given by 

VM~I (^+ 1 ) 2 /H) 0<r <a 
V *V>~\ r /b a<r< diam(G) ' 

Consequently, W(v) = inf{r : V*{r) > v} satisfies 

f «V2( a6 )Va l/ab<v<a/b 
1 > ~ \ vb a/b<v<l 

Since this group walk is driven by a constant number of generators, the chain 
satisifes a local Poincare inequality with constant independent of a and b. For v < 
1/2, the lower bound on the A(v) now follows from Theorem 14. f I By Lemma \'2. 21 
^i > A(l/2)/2. For v > 1/2, the lower bound on A(r) then follows from the fact 
A(r)>Ai. 

For the upper bound, for m > 1 define linear functions f m € co(B(0,m)) by 

/ ro (x) = l-^M. 

m 
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Then, 

£(im,/m) = ^r^^ifmix) - f m (y)] 2 K(x,y) 
x -.y 

< v*M 

2m 2 

Since G is volume doubling (i.e. 14 (2m) »s 14 (m)) 

V 2 



||/m||a > (jj F *( m / 2 ) ^( TO ) 
and consequently, for 14 (m) < 1/2 

£{fm, fm) ^ „ ^ (/"H _^ 1 

Var(/ m ) ~ H/mlll ~ m2 ' 

Since it is sufficient to consider only u < 1/2, the upper bound on A(i>) follows by 
taking m = W(v) — 1. 

Following the notation of Theorem 13. II define j(t) implicitly through 



L(r) 



C/rab l/ab<r < a/b 
C/r 2 b 2 a/b<r<\ 



and observe that for C sufficiently large, the walk satisfies the anti-Faber-Krahn 
inequality with L(r) by the upper bound on A(r). Then j(t) is given explicitly as 

[Ct + l)/ab 0<t<(a 2 -l)/C 
(2Ct-a 2 + 2) 1 /2/fo (a 2 -l)/C<t<{b 2 + a 2 ~2)/{2C) ' 



7(*) 



Furthermore, 

_ / C/(Ct + l) 0<i<(a 2 -l)/C 



7 (t) \ C/(2Ct- a 2 + 2) (a 2 -l)/C<t< (b 2 + a 2 - 2) / (2C) ' 

So, j(t) is (5-regular on (0,T) with 5 = 1/6 and T = {b 2 + a 2 - 2)/(2C). For 
i < c6 2 and c sufficiently small, the lower bound on convergence now follows from 
Theorem rm 

Let {Ai} be the eigenvalues of I — K with corresponding real orthonormal eigen- 
functions {ipi}- Since ht{x,y) = H t l y (x), writing l y (-) in this L 2 basis we have 

n—1 n—1 

ht(x, y) = J2 e~ tx *Mx)My) = i + e'^M^Mv)- 

i=0 i=l 

In particular, 

sup h t (x,x) -1 > supe~ tAl -0 2 (a;) > e" tAl 

a; a; 

since the eigenfunctions are normalized in L 2 . Since Ai ~ A(l/2) » 1/6 2 , the lower 
bound on convergence rate follows. 

Now we show the upper bound. By the lower bound on A(r), using the notation 
of Theorem O 

t/ab 0<t<a 2 

V(t) h { tV 2 /b a 2 <t<b 2 . 
e ct/b 2 b 2 < t 
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Consequently, 



sup d% (H t (x,-),n) 

xex 



-< 



ab/t 
b/t 1 / 2 

p -ct/b 2 



0<t<a 2 
a 2 <t<b 2 
b 2 <t 



Since the walk is reversible, the upper bound on doo i7r then follows from the argu- 
ment of Theorem ll.fi □ 

For a <C b, Lemma [4.61 captures the fact that decay is fast at the start of the 
walk and then slows down. More specifically, 

dooAHt^) ~ 1/K(i 1/2 ) * < diam 2 (G). 

As shown in |DSC95| , this relationship holds in general for random walks on groups 
with volume doubling. While we considered only the simple case of Z a x Z&, the 
technique applies well to more general fc-fold products. 
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